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ABSTRACT 

This paper investigates weighted approximations for studentized U- 
statistics type processes, both with symmetric and antisymmetric 
kernels, only under the assumption that the distribution of the 
projection variate is in the domain of attraction of the normal law. 
The classical second moment condition E\h(Xi, X 2 )\ 2 < oo is also 
relaxed in both cases. The results can be used for testing the null 
assumption of having a random sample versus the alternative that 
there is a change in distribution in the sequence. 
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1 Introduction and main results: the case of sym- 
metric kernels 



Let X, X 1 ,X 2 , ... be a sequence of non-degenerate independent real- valued random vari- 
ables with distribution function F. Suppose we are interested in testing the null hypoth- 
esis: 

Hq : Xi, 1 < i < n, have the same distribution, 
against the one change in distribution alternative: 

Ha : there is an integer k, 1 < k < n, such that 

P{X 1 <*) = ... = P(X k < t), P(X k+1 <*) = ... = P(X n < t) 
for all t and P(X k < t ) ^ P(X k +i < t ) for some to- 
Testing for this kind of a change in distribution has been studied extensively in the 
literature by using parametric as well as non-parametric methods. One of the non- 
parametric methods was proposed by Csorgo and Horvath (1988a, b), who used func- 
tional of a [/-statistics type ([/-type, from now on) process to test H against Ha- Let 
h(x, y) be a measurable real valued symmetric function, i.e. h(x, y) = h(y, x). The [/-type 
process of Csorgo and Horvath (1988a, b) is defined by 

U n (t) = Z [(n+1)t] - n 2 t{\ - t)9, < t < 1, 
where 9 = Eh(X 1 ,X 2 ), and 

k n 

4 = EE h(Xi,Xj), l<k<n. 

i=l j=k+l 

While Z k itself is not a [/-statistic, it can be written as the sums of three [/-statistics 
[cf. Csorgo and Horvath (1988a, b, 1997)]. The rational behind the definition of Z k is 
comparing the first k observations to the remaining (n — k) ones for k = 1, . . . , n — 1, via 
an appropriate bivariate kernel function h(x, y) for the sake of capturing the possibility 
of having a change in distribution at an unknown time k as postulated in Ha- Typical 
choices of symmetric kernel h are xy, (x — y) 2 /2 (the sample variance), \x — y\ (Gini's 
mean difference), and sign(a: + y) (Wilcoxon's one-sample statistic). 

Throughout the paper, we write g(t) = E (h(X,t) — 6) , a 2 = Eg 2 (Xi) and, for later 
use, we define a Gaussian process T by 

r(t) = (l - *) w(t) + t [w{\) - w{t)\ , o < t < i, (i) 

where {W(t),0 < t < oo} is a standard Wiener process. Furthermore, let Q be the 
class of positive functions q on (0,1), i.e., mf$< t <i-$ q(t) > for < 5 < 1, which are 
nondecreasing in a neighbourhood of zero and nonincreasing in a neighbourhood of one, 
and let 

J(g ' c) = L W^T) exp ( " t(TTt) ) dt > < c < °°- 



l 



In terms of these notations, Csorgo and Horvath (1988a, b), Szyszkowicz (1991, 1992) 
established the following result [cf. Theorem 2.4.2 in Csorgo and Horvath (1997)]. 

Theorem A Assume H , < a 2 < oo and E\h(Xi, X 2 )\ 2 < oo. Then, on an appropri- 
ate probability space for X,X ± ,X 2 , ■ ■ ■ , we can define a sequence of Gaussian processes 
{r n (t),0 < t < 1} such that the equality in distribution 

{T n (t),0<t<l}= d {T(t),0<t<l} (2) 

holds for each n > 1, and as n — > oo, 



sup 

0<«1 



n 



-3/2, 



a- l U n (t)-T n (t) /q(t) = o P (l). (3) 



if and only if I(q, c) < oo for all c> 0. 

Remark 1 The condition E\h(X 1 , X 2 )\ 2 < oo implies that Eg 2 (Xi) < oo, and we 
assume that a 2 = Eg 2 (X 1 ) > 0. This is the so-called non-degenerate case when studying 
[/-statistics via the function g(t) = E(h(X,t) — 9) that induces the projection of U- 
statistics into sums of i.i.d. random variables, the so-called Hoeffding (1948) projection 
principle that, in part, rests on a paper of Halmos (1946). 

For functions x, y in D[0, 1] and q G Q, we define the weighted sup-norm metric \ \/q\\ 

by 

\\{x-y)/q\\ = sup \(x(t)-y(t))/q(t)\, 

0<t<l 

whenever this is well defined, i.e., when lim sup \(x(t) — y(t))/q(t)\ is finite for t I and 
t t 1. 

In view of (2) and this terminology, (3) of Theorem A implies the following weak 
convergence, a functional limit theorem. 

Corollary A With q G Q, and standing for convergence in distribution as n — > oo, 
we have 

h{n-^a-'U n {.)/q{.)} ^ d h{T(.)/q(-)} 

for all h : D = D[0, 1] — > M that are (D,T>) measurable and \ \/q\\- continuous, or \ \/q\\- 
continuous except at points forming a set of measure zero on (D,T>) with respect to the 
measure generated by the Gaussian T(-) process, if and only if I(q, c) < oo for all c > 0, 
where V denotes the a -field of subsets of D generated by the finite dimensional subsets of 
D. 

Remark A For further use the statement of Corollary A will be summarized by writing, 

as n — > oo, 

n- 3 V[/ n (-)/g(.)4r(.)/ g (.) on (D[0, 1], V, \\/q\\). 

For a summary of notions of convergence and weak convergence in general along these 
lines, we refer to pages 26-28 and Remarks 2 and 3 on page 49 of Shorack and Wellner 
(1986), and to Sections 3.3 and 3.4 of Csorgo (2002). 
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Thus Theorem A provides a basic tool for investigating the asymptotic behaviour of 
many test statistics for testing Hq versus Ha via corresponding functional of T(-)/q(-) 
for appropriate choices of the kernel h(x, y). This, in turn, motivates the establishment of 
our first result, in which we reduce the moment conditions related to the kernel h(x,y). 
It reads as follows. 

Theorem 1 Assume H , < a 2 < oo and E\h(X 1: X 2 )\^ 3 < oo. Then, on an appro- 
priate probability space for X, Xi, X 2 , • • • , we can define a sequence of Gaussian processes 
{T n (t),0 < t < 1} such that (2) holds true, and if I(q,c) < oo for some c > 0, then as 
n — > oo, 



sup 

l/n<t<(n-l)/n 



n 



-* /2 a- l U n {t) - F n (t) /q(t) = o P (l). (4) 



In addition to reducing the moment conditions required in Theorem A, the result (4) 
of Theorem 1 generalizes (3) as well. Namely, as a direct consequence of Theorem 1, we 
have the following corollary. 

Corollary 1 Assume H , < a 2 < oo and E\h(X u X 2 )| 4/3 < oo. If q E Q, then 

(a) we still have the conclusion of Theorem A, i.e., (3) holds true if and only if 
I(q, c) < oo for all c > 0; 

(b) as n — > oo ; 



n 



-^a^U n {.)/q{.) =► r(.)/?(0 on (£>[0, 1], V, \\/q\\) (5) 



if and only if I(q, c) < oo for all c > 0; 
(c) as n — > oo ; 

n-^V" 1 sup \U n (t)\/q(t) ^ d sup \T(t)\/q(t) (6) 

o<t<i ' 0<t<l 

if and only if I(q, c) < oo for some c > 0. 

We note in passing that (a) implies (b), just like (3) implies Corollary A (cf. (a) of 
Lemma 3). However, (a) does not imply (c) (cf. (b) of Lemma 3). 

In view of the definition of Z^, and hence also that of U n (t), when 9 and a are known, 
large values of the statistic on the left hand sides in (6) for example, indicate a change 
in the distribution, and hence, based on Corollary 1, rejection of H can be quantified 
accordingly. Otherwise 6 and a need to be estimated. A natural estimate of 9 is 

and that of a 2 is 

in -i n „ 

n j=l U j=l 
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According to the definition of g(x), g(Xj) still depends on the usually unknown distribu- 
tion function F of X, and hence it then can not be computed explicitly. Since we have 
that g(x) + 9 = J h(x,y)dF(y), we can replace F by the empirical distribution function 
F n of Xi,X 2 , ■■■jXn under H . Consequently, we may for example estimate a 2 by 

1 n 1 n 1 



3 

We note that this estimate is in fact the jackknife estimator of Var(9). Now we may 
introduce a studentized U-type process as follows: 

U n {t) = n^ia)- 1 (Z [(n+1)t] - n 2 t(l - t)§) , < t < 1. 

This process does not depend on the unknown parameters 9 and a and we now state the 
following main result of this paper, in which we replace the assumption that < a 2 < oo 
by assuming only that g(Xi) is in the domain of attraction of the normal law, written 
g{X 1 ) G DAN throughout. 

Theorem 2 Let q G Q. Assume H , E\h(X 1 , X 2 )\ 5/3 < oo and that g(X x ) G DAN. 
Then, on an appropriate probability space for X, Xi, X 2 , ■ ■ ■ , we can define a sequence of 
Gaussian processes {T n (t),0 < t < 1} such that (2) holds true and, as n — > oo ; 

sup U n {t)-Y n {t) /q(t) = o P (l), (7) 
o<«i ' 

if and only if I(q, c) < oo for all c > 0. Consequently, as n — > oo ; 

tf»(0M0 =► r(-) /?(•), on(D[0,l],P,||/g||) (8) 
if and only if I(q, c) < oo for all c > 0. Furthermore, as n — > oo ; we a/so /iawe 

sup |C/ n (i)|/g(*) ^ d sup |r(t)|/g(t) (9) 

0<t<l ' 0<t<l 



i/ and on/j/ i/ /(g, c) < oo /or some c > 0. 



Remark 2 It is interesting and also of interest to note that the class of the weight 
functions in (9) is bigger than that in (8) [also compare (6) with (5)]. Such a phenomenon 
was first noticed and proved for weighted empirical and quantile processes by Csorgo, 
Csorgo, Horvath and Mason [CsCsHM] (1986) and then by Csorgo and Horvath (1988b) 
for partial sums on assuming E , |A A |*' < oo for some v > 2. For more details along these 
lines, we refer to Szyszkowicz (1991, 1996, 1997), and to Csorgo, Norvaisa and Szyszkowicz 
(1999). 

Remark 3 As we noted already in Remark 1, the condition that < o 2 = Eg 2 (Xi) < oo 
is the so-called non-degenerate case when studying U— statistics. In Theorem 1 it is a 
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necessary condition, while assuming E\h(Xi, X 2 )| 4//3 < oo is close to being necessary, on 
account of the central limit theorem for [/-statistics (see Borovskikh (2002), for example). 
Theorem 2 puts a totally new countenance on the classical theory of weak convergence 
for standardized {/-type process as in Theorem 1 [cf. also Theorem A, Section 2.2.4 of 
Csorgo and Horvath (1997), Gombay and Horvath (1995, 2002)] in that here we derive 
results assuming only g(Xi) G DAN and, consequently, we may have a 2 = Eg 2 (Xi) = oo. 
The price we pay for this is the somewhat higher moment condition E\h(Xi, X 2 )| 5//3 < oo 
than that of the corresponding one with exponent 4/3 in Theorem 1. What is crucial in 
Theorem 2 in this regard is that the existence of the second moment of h{X\,X-2) is not 
assumed, for assuming the latter would exclude the possibility of having a 2 = Eg 2 (X\) = 
oo (cf. Remark 1). 

This paper is organized as follows. In the next section we provide the proofs of main 
results. Then, in Section 3, we investigate the asymptotic behaviour of the {/—type 
process U n (.) when it is based on kernels that are antisymmetric, i.e., h(x,y) in such that 
h(x,y) = —h(y,x). Throughout the paper A,Ai, ... will denote constants which may be 
different in each appearance. 

2 Proofs of main results 

We need some preliminaries to proving our main theorems. The following lemma consti- 
tutes the key step. We note in passing that the three basic relations (11), (12), (13) of 
Lemma 1 are of interest on their own in studying U-statistics type processes, indepen- 
dently of their kernel function h(-, •) being symmetric, or antisymmetric. 

Lemma 1 Let ip(x,y) be a measurable real valued symmetric function for which we have 

J^(x,y)dF(y) = (10) 
and E\ip(X 1 ,X 2 )\^ 3 < oo. Then, as n — »■ oo ; 

j k n 

- max ^- 1/2 |E E 1>iXuXi)\ = O p (1), (11) 
n - - 1 i=i j=k+i 

- max (n-*)- 1/2 |E E = O p (1), (12) 

i=l j=k+l 

^^SJ% t *x ( , Xl )\ = o,(l). (13) 

_ _ t=l j=k+l 

Proof We only prove (11) and (13). By virtue of the symmetry of ip(x,y) and the 
i.i.d. properties of X i: the proof of (12) is similar to that of (11). We omit the latter 
details. 
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In order to prove (11), write 

il>*(x,y) = ii(x,y)I{\^ x , y )\<i3/2} - J J ip(u,v)I {mu>v)l < i3 /2 } dF(u)dF(v), 

g *(x) = J ip*(x, y)dF{y) and ip**(x, y) = ip*(x, y) - g*(x) - g*(y). 
It is readily seen that 

E[rj}*{Xi, Xj)\ = and E[^**(X U Xj) | X t ] = 0, for all i ^ j. (14) 

Having E^X^Xj) \ JQ] = by (10), we also have 

g*(Xi) = E\r{X i ,X j )\X l ] 

= E^X^X^I^^^ | X^-E^X^X^I^^]. (15) 

We now turn to the proof of (11). We have 

i k n 

- max ^- 1/2 |E E 1>(Xi,Xj) < h(n) + I 2 (n) + I 3 (n), (16) 

nl<k<n-l 



n 

where 

3 = 1 

i k n 



1 3 = 1 

Y k n 



h(n) = - max k' 1 ' 2 ]T £ MX^Xj) -P(Xi,Xj) + 9 m (*i) + 9 m (Xj)) 
n - - 1 i=i j=k+i 

We next prove It(n) = Op(l) for t — 1, 2, 3 and then (11) follows accordingly. 
First consider t = 1. Write y = £}=\ -Xj). Note that £(Y;Y fe ) = for aRi^k 

by (14). It is readily seen that 

£|£r 3 / 2 y| = ^r^y^^^r 2 ^ 2 ^,^)/^/,] 

i=2 i=2 i=2 

oo oo 

< A E{ip 2 (X 1 ,X 2 )I( k „i ) 3/2 <w < k 3/2] J2i~ 2 

k=l i=k 

< AE\i/j(X 1 ,X 2 )\ 4 / 3 < oo. (17) 

This, together with the Kronecker lemma, implies that k~ 3 l 2 J2i=i Yi 0, a.s., and hence 
h(n) = O p (1), since h(n) < 2 maxi< fe <„„i k~ 3 / 2 \ Ei=2 Y i\- 
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Secondly we prove I 2 (n) = O p (1). Write Z in = X)?=i i>**(Xi,Xj). By noting that, for 



any ai and k > 1, | X^=i &i = i>k — \ Yh=1 where 6j = X)t=i °t A> it follows that 



1 v^fc-i 



-^2 (w) < — 777T max /c 1 1 Z 



< 



max 



n 1 / 2 Kk<n-l'" |^~ m ~ n i/2 k'k^' 
Therefore, it only needs to be shown that, uniformly in n > 1, 

< A < oo. 



i=i 6 



(18) 



n 



1 I 00 1 

E\ E ~Zin 



1/2 



i=l 



(19) 



Indeed, the result (19) implies that -^2 HiLi 

0, a.s., as A; — > oo, uniformly in n > 1. This, toget 



< A < oo a.s., and ^ E^ fc 
rer with (18), yields 



—-pr max 

n 1 ' 2 l<k<N 



1 2 i 00 1 

]T-Z m <2A + max^ E- Z « =°p( 1 )- 

r— f z fc>l n 1 ^ 1 r-f 2 



i=l 



The proof of (19) follows from a similar argument as in the proof of (17). In fact, for all 
n > 1, we have 



1 I 00 1 

■ E\ E -Zin 



nV2 



i=l 



< 



1 i °° 1 

n i72-[^|E-^ 

«=1 



2n 1/2 
2n 1/2 



00 1 1/2 

< A[J2-E^(X 1 ,X 2 )I m<i3/ , ) ] 

i=i z 

< A[^(X 1 ,X 2 )M 1 / 2 < 00 , 



which yields (19). 

Finally we prove h(n) = Op(l). Recalling (15) and Eip(Xi, X 2 ) = 0, we have 
Kj := \^{X l) X j )-r{X t) X 1 )+g*{X l )+g*{X j )\ 

+ £?[|V(X i ,X J -)|%> i 3/2 ) |X j ] + ^^(X^X^/^,^)]. 

This implies that E(A i:j ) < AE[\^p(X 1 , X 2 )\I^ >i3 /2- ) , and hence 

i k n 



Eh{n) < -E max f'^^Ay 

71 L Kfe<n— 1 r ; 



J oo 



i=l j'=i 



< ^E^ 1/2 E^(a, 



n 



1=1 3 = 1 
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OO 1 



i=i 

OO 



< 



E - E tlV ; (^l,^2)|-?'(fc3/2 < | V ,|< {fc+1) 3/2 ) ] E^ 



k=l 



i=l 



< A E\^j{X 1 , X 2 )\ 4 / 3 <oo, 



(20) 



uniformly for all n > 1. By Markov's inequality, we obtain I^{n) = Op(l). The proof of 
(11) is now complete. 

The proof of (13) is similar to that of (11), but we have to use a different truncation. 
In the following, we let 



tp*(x,y) = tl){x,y)I mXty) \< n3/ 2 } - J J %l)(u,v)I m ^ v) \< n z/2 } dF{u)dF{v), 
g*{x) = J if)*(x,y)dF(y) and ip**(x,y) = ip*(x,y) — g*{x) — g*(y). It follows easily that 



k n 



n 3 / 2 l<k<n 

where Iq(u) 



max EE ^( X ^ X o) 



1 

:i '2 



1 j=fc+l 



< - 

- 2 



7 » + / 1 » + / 2 » + /*(n), (21) 



2 = 1 



/ 3 » 



1 



k k 



1=1 2^» 
2 = 1 



n n 



•V 2 Kk<n 



™^_J E E i>"(x»xs) 



i=k+l 2 = fc + l 

27^ 



^ .max : I E E MXuXA-PiXuXA+g'iXd+g'iXj)) 

- - i=l j=k+l 



It is readily seen that 

£[/ »f < An- 1 E^ 2 (X 1 ,X 2 )I ]mn3/2 

< A [e- 2 n^ 3 E\i>{X u X 2 ) | 4/3 + E\ip(X 1 , X 2 ) | 4/3 / H > re j 
— > 0, as n — > 00. 

This yields Iq(u) = op(l). Noting that {J2j=2 ^k, 2 < k < n} is a martingale, where 

ij = X)ti -^j) an d Fk — °"{^i> •••j-^'fc}) ^ follows from the well-known Maximum 

inequality for martingales that, for any e > 0, 

k n 

P(/*(n) > e) < 4e~ 2 n~ 3 E max | E Yjf < Ae~ 2 n~ 3 E £lf 



i=2 



i=2 



< A ' 
— > 0, as n — > 00. 



e- 2 n" 1 ^ X 2 ) | 4/3 + X 2 ) | 4/3 / H 



>n 
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This yields I*(n) = op(l). By a similar argument as in the proof for J*(n) = Op(l), we 
have I%(n) = op(l). As for I|(n), by using a similar argument as in the proof of (20), we 
obtain 

inn 

Ai 1=1 2 = 1 

< 4n 1 / 2 J E[|^(X 1 ,X 2 )|/ | ^ | > ri a /2 ] 

< 4E[|^(X 1 ,X 2 )| 4 % | > n3/2 ]-,0, 

as n — > oo, which implies that I|(n) = op(l). Taking all the respective estimates for 
If(n),t = 0, 1,2,3 into (21), we obtain the required (13). The proof of Lemma 1 is now 
complete. 

The next two lemmas are due to CsCsHM (1986) [cf. Lemma A. 5.1 and Theorem 
A. 5.1 respectively in Csogo and Horvath (1997)]. Proofs of Lemmas 2 and 3 can also be 
found in Section 4.1 of Csorgo and Horvath (1993). 

Lemma 2 Let q(t) G Q. If I(q, c) < oo for some c > 0, then 

lim t 1/2 /q(t) = and lim (1 - t) 1/2 /q(t) = 0. 

Lemma 3 Let {W(t), < t < oo} fre a standard Wiener process and q(t) G Q. Then, 

(a) /(g, c) < oo for all c> i/ and on/y i/ 

limsup \W(t)\/q(t) = 0, a.s. and limsup \W(1) - W{t)\/q(t) = 0, a.s. 
tio tii 

(b) I(q, c) < oo /or some c > if and only if 

limsup \W(t)\/q(t) < oo, a.s. and limsup \W(l) - W(t)\/q(t) < oo, a.s. 
tio tii 

We are now ready to prove our main theorems. 

Proof of Theorem 1. Together with the notation as in Section 1, we write ip(x, y) = 
h(x, y)-9 - g(x) - g(y) and T n (t) = W [{n+l)t] ,0 <t<l, where 

k n 
j=l j=k+l 

Noting that g{Xj) are i.i.d. random variables with Eg(Xi) = and a 2 = Eg 2 (Xi) < oo, 
along the lines of the proof of (2.1.45) in Csorgo and Horvath (1997), on an appropri- 
ate probability space for X, X ± , X 2 , • • • we can define a sequence of Gaussian processes 
{r„(t), < t < 1} such that, for each n > 1, 

{r n (*),o<t<i}= d {r(t),o <*<i}, 
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and if q G Q and I(q, c) < oo for some c > 0, then, as n — > oo, 

sup n- 3 /V 4 T„(i)-r„(t) /g(f) = o P (l). 

l/n<t<(n-l)/n 

By virtue of (22), Theorem 1 will follow if we prove 



sup 

l/n<t<(n-l)/n 



n 



-*/*U n (t) - n-V 2 T n (t) /q (t) = o P (l). 



(22) 



(23) 



In order to prove (23), write V n (t) = Wf (n+1)t] , where W£ = £* =1 E] =k+ i ip(X h Xj) 
Note that E($(X U X 2 ) \ X x ) = E(iP(X 1 ,X 2 ) \ X 2 ) = and 

£|^(X 1? X 2 )| 4 / 3 < AE\h(X 1 ,X 2 )\ 4 / 3 < oo. 

It follows from (13) that 



sup \n- 3 / 2 V n (t)\/q(t) 

5<t<l-S ' 



k n 



~ i<T<n-i S E jup g- x (t) = P (1), 

n 1 i=lj=k+l 5<t<l-5 



for any 5 G (0, 1) and q e Q. Let 5 > be so small that q(t) is already nondecreasing on 
(0, S) and nonincreasing on (1 — 5, 1) and let n be so large such that 1/n < 5. It follows 
from (11) and Lemma 2 that 



:= sup |n- 3 /V n (t)|A(t) 

0<t<<5 



k n 



< - max AT 1 ' 2 £ £ sup t 1 / 2 /^) = o P (l), 



when n — > oo and then 5^0. Similarly, we have also 
Jf := sup \n- 3 ' 2 V n (t)\/q(t) 

1— <5<i<l ' 



I fc ri 

< - max (n-k)- 1 ^ £ VPQ,^)| sup (1 - tf 2 /q(t) 

Ul<k<n-1 l i=l j=k+ l 1-S<t<l 

= o P (l), 

when n — > oo and then <5 — * 0. By virtue of these estimates, it is readily seen that 

< + Ji 2) + Ji 3) + An- l/2 sup l/g(i) = o P (l), (24) 

l/n<t<(n-l)/n 



which yields (23). The proof of Theorem 1 is now complete. 



10 



Proof of Corollary 1. Having Theorem 1, Lemmas 2-3 and the result (23), the 
proof of Corollary 1 is the same as that given in the proof of Theorem 2.4.2 in Csorgo 
and Horvath (1997), and hence the details are omitted. 

Proof of Theorem 2. We first prove (7). It is readily seen that 

u n (t) = n-'/^ay^z^^-nHii-m + tii-ty/'i&y^e-e) 

= Phf™ } 172 n ~'{ 1 9 2 {X 3 )}- V2 U n {t) + t(l - t)n^{a)-\e - 9). 
I na ) j=i 

(25) 

Furthermore U n (t) = T n (t) + V n (t), where T n (t) and V n (t) are defined as in the proof of 
Theorem 1. Recalling that g(X\) is in the domain of attraction of the normal law, as in 
the proof of Theorem 5.2 of Csorgo, Szyszkowicz and Wang [CsSzW] (2004) with minor 
modifications, we have that on an appropriate probability space for X, Xi, X 2 , ■ ■ ■ , we can 
define a sequence of Gaussian processes {T n (t),0 < t < 1} such that (2) holds true, and 
as n — > oo, 



sup 

0<t<l 



" 1 {Y,9 2 {X j )Y ll2 T n {t)-T n {t)\/q{t) = o P (l), 



if and only if I(q, c) < oo for all c > 0. Therefore, to prove (7), it suffices to show that 

ra-^Xypg}" sup \V n (t)\/q(t) = op(l), (26) 
L j=1 J 0<t<l 

{n-^g^X,)}- 1 ^-! = op(l), (27) 

3=1 

and 

nV2(£)-i(0_0) = 0p (i). (28) 

The proof of (26) is simple and in fact (26) holds true if q(x) satisfies I(q, c) < oo 
for some c > 0. Indeed, since g(Xi) is in the domain of attraction of the normal 
law, we have Z)"=i g 2 {Xj) — *p 1, where b n = nl(n) with that l(n) = Eg 2 (Xi) if 
Eg 2 (X\) < oo or l{n) — > oo if -E^ 2 (Xi) = oo. On the other hand, as in the proof of (23), 
rT z l 2 sup 0<t<1 \V n (t)\/q(t) = Op(l) even when q{x) satisfies I(q,c) < oo for some c > 0, 
and hence (26) follows immediately from these facts. 

We next prove (27). The claim (28) follows by using (27), and hence the details are 
omitted. Without loss of generality, we assume 9 — 0. We may rewrite a 2 as 

2 _ ' \ " U(V V\UIV V\ i ' S^h2fV V\ Q2 



a = 



J2 h(X t ,X J )h(X t ,X k ) + - J2h 2 (X t ,X J )-9 2 



■ = W nl + W n2 -9 2 . 
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Recalling E\h(X±, X 2 )\ 5 ^ 3 < oo, it follows from a Marcinkiewicz type strong law for U- 
statistics that W n2 — 6* 2 ^ 0, a.s. [see Gine and Zinn (1992), for example]. Therefore (27) 
will follow if we prove 

{n^fypgJ'Vnx-l = o P (l). (29) 

Write, for % ^ j ^ k, 

hff = /i(^,^.)/ ( |,|< n 6/ B) , = E{h { $\Xi), 

i>ijk = hff h\f - Eh \f h\f , 

<p? = E(^ k \X t ), iff = E(if> ijk \Xi), V f ] = E(xjj ijk \X k ). 

Noting that E{hff hf k ] \Xi} = {g (1 \Xi)}\ it is readily seen that ipW = {g^(Xi)} 2 - 

E[hff fcg>],and 

E ^ ^ = E I'm + E ^ M?] 

i^j^k i+d+k i^j^k 

= E {9 {1) (Xi)} 2 + E {^ 2) + ^ 3) } 

+ E (^-v^-v^-y^) 

In the next paragraph, we will show that 

n , 

{n-^g'iXj)}' (n- 3 Ki)-l = o P (l), (30) 
n- 3 (K 2 + K 3 ) = o P (l). (31) 

It follows from (30) and (31) that 

n , 

{n-'EAXj)}- n" 3 E - 1 = Ml), (32) 

i=l i+j+k 



and then (29) follows from (32) and 



p \ E E C fc * ^ n 2 p(iM^i,^ 2 )i>n 6 / 5 ) 

< E\h(X 1 ,X 2 )\ 5 / 3 I lhl ^ ne/5 ^0. 
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We are to prove (30) and (31) now. Consider (30) first. By noting that g^(Xi) = 
g{X x ) - g*(Xj), where g*(Xj) = E{h(X 1 ,X 2 )I m > n ^ ) \X l }, we have 



n n 

E[{5 (1) (^)} 2 -/W]| < E[%(^-)lbWI + b*(^ 



3=1 



3=1 



< 2[E^ 2 (^)] 1/2 [E{^ra} 



1/2 



i=i 



Now, since g(Xi) is in the domain of attraction of the normal law [which implies that 
\YJj=i 9" \X j) C > 0, where C may be oo], simple calculations show that (30) will 
follow if we prove 



1 



n 



(33) 



j'=i 



In fact, for any e > 0, we have 

n n 

^(EM^)} 2 >^) < ^-^E^I/TO 



3=1 



3=1 



< e - 1 / 2 n 1 / 2 E|/ i (x 1 ,x 2 )|/ (l , l > n6/5) 

< e- 1 /2 jE | /l(Xl!X2) |5/3 /(|A| ^ 6/5) ^ 0) 

as n — > oo. This implies (33) and hence completes the proof of (30). 

We next prove (31). By noting that n~ 3 V n3 is a degenerate [/-statistic of order 3, it 
follows from moment inequality for degenerate [/-statistics (see, Borovskikh (1996), for 
example) that, for any e > 0, 



P(\V n ,\>en 3 ) < e-^n- 5 E\V n ,\ 5/3 



^123 - - v4 2) - v4 3) 



5/3 



< Ae- 5/3 n~ 2 E 

< Ae-^n-'ElhiX^X^I^^ 

< Ae^ 3 {n-^ + E\h(X u X 2 )\^I m > nl/2 

as n — > oo. On the other hand, by noting that 



0, 



(34) 



E{E[h® h$\X 2 ]} 2 = E{h<8h$E 



a 42 a A5 1^2 



} 



< 



Eh 2 (X 1 ,X 2 )I mn6/5 ] 2 < {E\h(X 1: X 2 )\ 5 / 3 Y 



it is readily seen that, for any e > 0, 



PdKsl > en 6 ) < e-' z E{n- 6 V. r 



'n2 
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< Ae- 2 n 



2 „-l 



E{E(h$hU\X 2 )}\(E{h$}y 



< Ae- 2 n- 1 / 5 {E\h(X 1 ,X 2 )\ 5 / 3 } 2 -> 0, 



(35) 



as n — > oo. By virtue of (34) and (35), we obtain (31). The proof of (7) is now complete. 

The result (8) is a direct consequence of (7). As for (9), by virtue of (25)-(28) (recalling 
that (26) still holds true for q(x) satisfying I(q, c) < oo for some c > 0, as explained in 
its proof), it suffices to show that 



sup 

0<t<l 



n 



- 1 {Y.9\X ] )Y 1,2 T n {t) 



> d sup \T(t)\/q(t) 

0<t<l 



(36) 



if and only if I(q, c) < oo for some c > 0, where T n (t) = W[( n+ i) t ],0 <t<l, with 



W k = (n-k) , £ t g(X j )+k £ 9(*i 

3=1 j=k+l 



This follows from the same arguments as in the proof of Corollary 5.2 in CsSzW (2004), 
and hence the details are omitted. This also completes the proof of Theorem 2. 



3 Antisymmetric kernel 

In this section we consider the asymptotics of [/-type processes with antisymmetric kernel 
h(x,y), i.e., h(x,y) = —h(y,x). This kind of kernels can not be symmetrized, but they 
are especially useful to check the equality of distributions for different groups of random 
variables since 9 = Eh(Xi, X 2 ) = whenever Xi = d X 2 , if E\h(Xi, X 2 )\ < oo. Conse- 
quently, for antisymmetric kernels, U n (t) = Z[( n+1 ) t ] under H Q .. An example is given in 
Pettitt (1979), who used functions of the Mann- Whitney type statistics 

(12)^- 3 / 2 E E sign 

l<i<nt nt<j<n 

to detect possible changes in distribution. Another important example is given by taking 
H(x,y) = x — y for studying the probable error of a change in a mean. We will say more 
about that in Remark 5. 

For the anti-symmetric kernel h(x,y), by letting g(t) = Eh(X 1 ,t), i.e., keeping our 
earlier notation with 9 = 0, we may write 

k n k h 11 

4 = EE ii(X i ,X j )+n[j29(Xi)--T,9(Xi)], 

i=l j=k+l i=l U i=l 

where ip(x, y) = h(x, y) + g(x) — g(y) with 

E [^{X U X 2 ) \X 1 ] = E mX,,X 2 ) | X 2 ] = 0. 
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Since Lemma 1 does not depend on the symmetry of the kernel, similarly to the 
proofs of Theorems 1 and 2, we have the following results for [/-type processes with 
antisymmetric kernel h(x,y), which improve and generalize the similar earlier results of 
Csorgo and Horvath (1988a, b), Szyszkowicz (1991, 1992) and those given in Section 2.4 
of Csorgo and Horvath (1997) along these lines. It is interesting to note that the Gaussian 
limit process that is shared by Theorems 1 and 2 and that shared by Theorems 3 and 4 
are different, although they are of equal variance. For further related results, we refer to 
Janson and Wichura (1983), and Gombay (2000a, b, 2001, 2004). 

We continue to use the notations introduced in Section 1, but U n (t) and U n {t) are now 
defined in terms of antisymmetric kernel h(x,y) = —h(y,x). Consequently, under Hq, 9 
and 9 are both zero now. 

Theorem 3 Let q E Q. Assume H , < a 2 < oo and E\h(X 1 , X 2 )| 4/3 < oo. Then, on 
an appropriate probability space for X, X\, X 2 , • • ■ , we can define a sequence of Brownian 
bridges {B n (t), < t < 1} such that if I(q, c) < 00 for some c > 0, then as n — > oo, 



sup 

l/n<t<(n-l)/n 



n 



-*/*a-iU n (t) - B n (t) \/q(t) = o P (l). (37) 



Consequently, 

(a) as n — > oo, 



sup {n-^a^Unit) - B n (t)\/q(t) = o P {\) (38) 

0<t<l 



if and only if I(q, c) < 00 for all c > 0; 
(b) as n — > oo, 



n^/V-UO /?(•) =► B{.) /q(-) on (D[0, l],V,\\/q\\) (39) 

if and only if I(q, c) < 00 for all c > 0; 
(c) as n — > 00, 

n-^a- 1 sup \U n (t)\/q(t) ^ d sup \B(t)\/q(t) (40) 

0<t<l ' 0<t<l 

if and only if I(q,c) < 00 for some c > 0, where, in (b) and (c), {B(t),0 < t < 1} is a 
Brownian bridge. 

Theorem 3 is to be compared to Szyszkowicz (1991, Theorem 2.1) [cf. Theorem 2.4.1 
in Csorgo and Horvath (1997)]. 

Theorem 4 Let q e Q. Assume H , E\h{X l) X 2 )\ b ^ < 00 and that g(X 1 ) E DAN. 
Then, on an appropriate probability space for X, Xi, X2, • • ■ , we can define a sequence of 
Brownian bridges {B n (t),0 < t < 1} such that, as n — > 00, 



sup 

0<t<l 



U n (t)-B n (t) q(t) = o P (l), (41) 
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if and only if I(q, c) < oo for all c > 0. Consequently, as n — > oo, 



[/„(•)/?(•) =► 5(0 /?('), on( J D[0,l],P,||/g||) 



(42) 



i/ and only if I(q,c) < oo /or all c > 0, where {B(t),0 < t < 1} is a Brownian bridge. 
Furthermore, as n — > oo, we a/so nave 



i/ and on/?/ if I(q, c) < oo /or some c > 0. 

Remark 4 As compared to Theorem 3, where it is assumed that < a 2 = Eg 2 (Xi) < oo, 
in Theorem 4 we assume only that g(Xi) is in the domain of attraction of the normal law 
and, consequently, we may have a 2 = Eg 2 (X 1 ) = oo, just like in Theorem 2 (cf. Remark 



Remark 5 On taking h(x,y) = x — y, Theorem 4 essentially extends Corollary 2.1.1 of 
Csorgo and Horvath (1997) [cf. Theorem 5.1 in CsSzW (2004)] and rhymes with Theorem 
5.2 and Corollaries 5.1 and 5.2 of CsSzW (2004) [cf. also Theorem 2.1 and Corollaries 2.1 
and 2.2 of CsSzW (2006)], where we study the problem of change in the mean in DAN 
directly via Theorem 2 and Corollaries 3 and 4 of CsSzW (2007), quoting these results 
without proof for the sake of studying the probable error of a change in a mean in the 
domain of attraction of the normal law. In this regard our present Theorems 2 and 4 can 
be viewed in part as extensions of the initial scope of our research in CsSzW (2007) on 
weighted approximations of self-normalized partial sum processes to those of Studentized 
U-statistics type processes with symmetric and antisymmetric kernel functions h(-,-), 
whose respective projections g(Xi) are in DAN. 
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